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Abstract 

^ | The goal of this paper is to review some of the main ideas that emerged from the attempts 

\Q " to confirm mathematically the predictions of the celebrated Parisi ansatz in the Sherrington- 

Kirkpatrick model. We try to focus on the big picture while sketching the proofs of only a few 
selected results, but an interested reader can find most of the missing details in [ 31] and (44). 
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1 Introduction 

The Sherrington-Kirkpatrick model. In 1975, Sherrington and Kirkpatrick [37] introduced a 
mean field model for a spin glass — a disordered magnetic alloy that exhibits unusual magnetic 
behavior. Given a configuration of N Ising spins, 



(N 



o = (o h ...,G N )eZ N = {-l,+l} N , 
the Hamiltonian of the model is given by 



- 1—1 

X: 

VN iJ=1 

where (gjj) are i.i.d. standard Gaussian random variables, collectively called the disorder of the 
model. The fact that the distribution of H^(o) is invariant under the permutations of the coordinates 
of o is called the symmetry between sites, which is what one usually understands by a mean field 
model. The Hamiltonian (OQ) is a Gaussian process with the covariance 



EH N (a l )H N (a 2 ) = i £ a/ or) of of = nQ- £ a} off = NR{ 2 (2) 

i,j=\ (=1 
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that depends on the spin configurations a 1 , a 2 only through their normalized scalar product 

called the overlap of a 1 and a 2 . Since the distribution of a Gaussian process is determined by its 
covariance, it is not surprising that the overlaps play a central role in the analysis of the model. 
One can also consider a generalization of the Sherrington-Kirkpatrick model, the so-called mixed 
p-spin model, which corresponds to the Hamiltonian 

H N (a) = £ P P H N , p (o) (4) 

P >\ 

given by a linear combination of pure p-spin Hamiltonians 

1 N 

H N , p (o)= 1)/2 £ gi 1 ... ip o il ---o ip , (5) 

h,...,i p =\ 

where the random variables (g^...* ) are standard Gaussian, independent for all p > 1 and all 
(i\,...,ip). Similarly to ©, it is easy to check that the covariance is, again, a function of the 
overlap, 

EH N (o l )H N (o 2 ) = N%(R h2 ), where jSgx". (6) 

One usually assumes that the coefficients (j5 p ) decrease fast enough to ensure that the process is 
well defined when the sum in © includes infinitely many terms. The model may also include 
the external field term h(o\ + . . . + 0)v) with the external field parameter ftel. For simplicity of 
notation, we will assume that h = 0, but all the results hold in the presence of the external field with 
some minor modifications. One of the main problems in these models is to understand the behavior 
of the ground state energy mm a<E -£ N Hn(o) in the thermodynamic limit N — > °°. In a standard way 
this problem can be reduced to the computation of the limit of the free energy 

F N = ^-E\ogZ Nl where Z N = £ exp(-fiH N (o)), (7) 

for each inverse temperature parameter j8 = 1/T > 0, and a formula for this limit was proposed 
by Sherrington and Kirkpatrick in [|37ll based on the so-called replica formalism. At the same 
time, they observed that their replica symmetric solution exhibits "unphysical behavior" at low 
temperature, which means that it can only be correct at high temperature. Several years later, Parisi 
proposed in [|32l , ||33l , another, replica symmetry breaking, solution within replica theory, now 
called the Parisi ansatz, which was consistent at any temperature T > and, moreover, was in 
excellent agreement with computer simulations. The Parisi formula for the free energy is given by 
the following variational principle. 

The Parisi formula. A basic parameter, called the functional order parameter, is a distribution 
function £ on [0, 1], 

Z(Up}) = Cp-Cp-i fotp = 0,...,r, (8) 
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corresponding to the choice of r > 1, 

= C 1 < Co < - - - < C 1 < Cr = i (9) 

and 

= q <qi < ... <q r -\ <q r = 1- (10) 

Notice that £ carries some weight on r — 1 points inside the interval (0, 1) and on the points and 
1 . In general, one can remove the atoms qo = and q r = 1 and allow £o = and £ r _ i = 1 , but these 
cases can be recovered by continuity, so it will be convenient to assume that the inequalities in © 
are strict. Next, we consider i.i.d. standard Gaussian random variables (T] p )o<p<r and define 

Z r = logchj8(77 r(0) 1/2 + £ Tlp{?{qp)-?bp-l)) 1/2 )- (11) 

\<p<r 

Recursively over < I < r— 1 , we define 

X / = -logE / exp^/+i, (12) 
Q 

where E/ denotes the expectation with respect to rj/ + i only. Notice that Xo is a function of T]o- 
Finally, we let 6(x) =x£,'(x) — t, (x) and define the so-called Parisi functional by 

^(C) = io g 2+Ex -i £ ( P (0(? P+ i)-eW). (13) 

Z 0<p<r-l 

The Parisi solution predicted that the limit of the free energy is equal to 

limFjv = inf^(£), (14) 

N— f 

where the infimum is taken over all distribution functions £ as above or, in other words, over all 
r > 1 and sequences © and (TTOl) . The replica method by which the formula (fl4)) was discovered 
did not give a definite interpretation of the f.o.p. £ or the functional (fT3l) . but a more clear picture 
emerged in the physics literature (a classical reference is [22]) during the subsequent interpretation 
of the Parisi ansatz in terms of some physical properties of the Gibbs measure of the model, 

exp(-/3// jv (a)) 
Gn{o) = = , (15) 

where the normalizing factor Z/v defined in © is called the partition function. To describe this 
picture, let us first explain a modern mathematical framework that is used to encode relevant infor- 
mation about the model in the thermodynamic limit. 

Asymptotic Gibbs' measures. Notice that, due to the special covariance structure ©, the distri- 
bution of the Gaussian Hamiltonian © is invariant under orthogonal transformations of the set of 
spin configurations which means that, given any orthogonal transformation U on M. N , we have 
the equality in distribution 
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As a result, we are just as interested in the measure o U on the set U (Ljy) as in the original 
Gibbs measure G^. To encode the information about G^ up to orthogonal transformations, let us 
consider an i.i.d. sequence (<7 )i>\ of replicas sampled from Gn and consider the normalized Gram 
matrix of their overlaps 

It is easy to see that, given R N , one can reconstruct the Gibbs measure Gm up to orthogonal trans- 
formations, because, every time we observe an entry Ri y equal to 1, it means that the replicas / 
and /' are equal. This way we can group equal replicas and then use the law of large numbers to 
estimate their Gibbs weights from the frequencies of their appearance in the sample. Since the 
Gram matrix describes relative position of points in the Euclidean space up to orthogonal transfor- 
mations, the overlap matrix R N can be used to encode the information about the Gibbs measure G^ 
up to orthogonal transformations. For this reason (and some other reasons that will be mentioned 
below), the Gibbs measure in the Sherrington-Kirkpatrick and mixed p-spin models is often iden- 
tified with the distribution of the overlap matrix R N . Since the overlaps are bounded in absolute 
value by 1, this allows us to pass to the infinite- volume limit and consider a set of all possible 
limiting distributions of R over subsequences. An infinite array R with any such limiting distribu- 
tion inherits two basic properties of R N . First, it is non-negative definite and, second, it satisfies a 
"replica symmetry" property 

( R K(i)Mn) i/>i = ( R i/)i/>v (17) 

for any permutation % of finitely many indices, where the equality is in distribution. Such arrays 
are called Gram-de Finetti arrays and the Dovbysh-Sudakov representation [fT51 (see also 112410 
guarantees the existence of a random measure G on the unit ball of a separable Hilbert space H 
such that 

{ R l,i>) ¥ i> = (° l -° l ) ¥ i» (18) 

where (<7 l ) is an i.i.d. sequence of replicas sampled from the measure G. We will call such mea- 
sure G an asymptotic Gibbs measure and think of it as a limit of the Gibbs measures Gn over 
some subsequence, where the convergence is defined by way of the overlap arrays, as above. The 
reason why the diagonal elements are not included in (TT8l) is because in (fT6b they were equal to 
1 by construction, while the asymptotic Gibbs measure is not necessarily concentrated on the unit 
sphere. This mathematical definition of an asymptotic Gibbs measure via the Dovbysh-Sudakov 
representation was first given by Arguin and Aizenman in Q. We will now describe (reinterpret) 
various predictions of the Parisi ansatz in the language of these asymptotic Gibbs measures. 

Order parameter and pure states. First of all, in the work of Parisi, I1341 . the functional or- 
der parameter £ in ® was identified with the distribution of the overlap R12 under the average 
(asymptotic) Gibbs measure, 

C(A) = EG® 2 ((a 1 , a 2 ) : R L2 = a 1 ■ a 2 e A) , (19) 

and the infimum in the Parisi formula (fT4"l) is taken over all possible candidates for this distribution 
in the thermodynamic limit. The fact that the infimum in (TT4l) is taken over discrete distributions 
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£ is not critical, since the definition of the Parisi functional can be extended to all distri- 

butions on [0, 1] by continuity. Another important idea introduced in ll3"4ll was the decomposition 
of the Gibbs measure into pure states. This simply means that an asymptotic Gibbs measure G 
is concentrated on countably many points (/?/)/> 1 in the Hilbert space H, and these are precisely 
the pure states. It was also suggested in ll34l that it is reasonable to assume that all the pure states 
have equal norm, so the asymptotic Gibbs measure G is concentrated on some non-random sphere, 
G(||/j|| = c) = 1. This implies, for example, that the largest value the overlap can take is R 12 = c 2 
when the replicas a 1 = o 2 = hi for some I > 1 and, since this can happen with positive probabil- 
ity, the distribution of the overlap has an atom at the largest point of its support, £({c 2 }) > 0. We 
will see below that, due to some stability properties of the Gibbs measure, the pure state picture is 
correct if the distribution of the overlap has an atom at the largest point c 2 of its support, otherwise, 
the measure G is non-atomic, but is still concentrated on the sphere \\h\\ = c. 

Ultrametricity. Perhaps, the most famous feature of the Parisi solution of the SK model in 11321 . 
(331, was the choice of an ultrametric parametrization of the replica matrix in the replica method, 
and in the work of Mezard, Parisi, Sourlas, Toulouse and Virasoro [|20l , fl2T|, this was interpreted 
as the ultrametricity of the support of the asymptotic Gibbs measure G in H, which means that 
the distances between any three points in the support satisfy the strong triangle, or ultrametric, 
inequality 

\\o 2 -o 3 \\ <max(||a 1 -CT 2 ||,||CT 1 -CT 3 ||). (20) 

When the Gibbs measure is concentrated on the sphere \\h\\ = c, we can express the distance in 
terms of the overlap, Ha 1 — a 2 || 2 = 2(c 2 — #1,2), and, therefore, the ultrametricity can also be 
expressed in terms of the overlaps, 

* 2 ,3>niin(/?i ) 2,tfi,3). (21) 

One can think about ultrametricity as clustering of the support of G, because the ultrametric in- 
equality (|20l ) implies that the relation defined by the condition 

a 1 a 2 || a 1 - a 2 1| < d (22) 

is an equivalence relation on the support of G for any d > 0. As we increase d, smaller clusters 
will collapse into bigger clusters and the whole process can be visualized by a branching tree. For 
a given diameter d > 0, one can consider the equivalence clusters and study the joint distribution 
of their Gibbs weights. In the case when d = 0, which corresponds to the weights of the pure 
states (G({/z/}))/>i, this distribution was characterized in [|20l using the replica method, but the 
same computation works for any d > 0. More generally, one can consider several cluster sizes 
d\< ... <d r and for each pure state hi consider the weights of the clusters it belongs to, 

G{\\a-hi\\<di),...,G{\\a-hi\\<d r ). 

Again, using the replica method within the Parisi ansatz, one can study the joint distribution of 
all these weights for all pure states, but the computation gets very complicated and not particu- 
larly illuminating. On the other hand, the problem of understanding the distribution of the cluster 
weights is very important, since this gives, in some sense, a complete description of the asymp- 
totic Gibbs measure G. Fortunately, a much more explicit and useful description of the asymptotic 
Gibbs measures arose from the study of some related toy models. 
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Derrida's random energy models. In the early eighties, Derrida proposed two simplified mod- 
els of spin glasses: the random energy model (REM) in ifTOl . ifTTTl . and the generalized random en- 
ergy model (GREM) in 021, Q3). The Hamiltonian of the REM is given by a vector (H^(o))aeL N 
of independent Gaussian random variables with variance N, which is a rather classical object. The 
GREM combines several random energy models in a hierarchical way with the ultrametric struc- 
ture built into the model from the beginning. Even though these simplified models do not shed 
light on the Parisi ansatz in the SK model directly, the behavior of the Gibbs measures in these 
models was predicted to be, in some sense, identical to that of the SK model. For example, Derrida 
and Toulouse showed in |fl4] that the Gibbs weights in the REM have the same distribution in the 
thermodynamic limit as the Gibbs weights of the pure states in the SK model, described in [|20l , 
and de Dominicis and Hilhorst flU demonstrated a similar connection between the distribution of 
the cluster weights in the GREM and the cluster weights in the SK model. Motivated by this con- 
nection with the SK model, in a seminal paper ll36l . Ruelle gave an alternative, much more explicit 
and illuminating, description of the Gibbs measure of the GREM in the infinite- volume limit in 
terms of a certain family of Poisson processes, as follows. 

The Ruelle probability cascades. The points and weights of these measures will be indexed by 
N r for some fixed r > 1 . It will be very convenient to think of N r as the set of leaves of a rooted 
tree (see Fig.Q]) with the vertex set 

^ = N°UNUN 2 U...UN r , (23) 

where N° = {0}, is the root of the tree and each vertex a = (ni, . . . ,n p ) G W for p < r — 1 has 
children 

an := (m,...,n p ,n) G W +l 
for all n EN. Therefore, each vertex a is connected to the root by the path 

— > n\ — > (ni,ri2) —>•••—>■ («i, . . . ,n p ) = a. 

We will denote all the vertices in this path by (the root is not included) 

p(a) = {fli,(7ii,/i2),...,(ni,...,n p )}. ( 24 ) 

The identification of the index set N r with the leaves of this infinitary tree is very important, 
because, even though the points in the support of the random measure will be indexed by a G N r , 
the construction itself will involve random variables indexed by vertices of the entire tree. For 
each vertex a G stf ', let us denote by | a\ its distance from the root of the tree 0, or, equivalently, the 
number of coordinates in a, i.e. a G W a \ . If we recall the parameters © then, for each a G &/\ W, 
let n a be a Poisson process on (0,°°) with the mean measure 

fax-i-fadx (25) 

and let us generate these processes independently for all such a. Let us recall that each Poisson 
process TI a can be generated by partitioning (0,°°) = U m >i5 m into disjoint sets S\ = [1,°°) and 
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N 



r-l - 




a = (hi. 



Figure 1: The leaves a eW index the pure states. The rightmost path is an example of p(oc) in 
(|24|) for one leaf a. The figure corresponds to what is called "r-step replica symmetry breaking" 
in the Parisi ansatz. 



S m = [l/ m ) l/( m— 1)) for m > 2 and then on each set S,„ generating independently a Poisson 
number of points with the mean 

from the probability distribution on S m proportional to (|25l) . Let us mention that, for technical 
reasons, it is important that the parameters C a \ in this construction are strictly between and 1, 
which is why we assumed that the inequalities in © are strict. One can arrange all the points in 
Tl a in the decreasing order, 

u a \ > u a i > ■ ■ ■ > u an > ■ ■ ■ , (26) 

and enumerate them using the children (an) n >\ of the vertex a. In other words, parent vertices a e 
srf \ W enumerate independent Poisson processes TL a and child vertices an E srf \ N° enumerate 
individual points u an . Given a vertex a£i/\N° and the path p(a) in (|24j) . we define 



w a = J! up- 



(27) 



Finally, for the leaf vertices a G N r we define 



(28) 



One can show that the denominator is finite with probability one, so this sequence is well defined. 
Now, let e a for cc £ gf\ N° be some sequence of orthonormal vectors in H. Given this sequence, 
we consider a set of points h a 6 H indexed by a G FT, 



1/2 



(29) 



where the parameters (q p )o<p< r were introduced in (fTOl) . In other words, as we walk along the 
path p(ot) to the leaf a G W, at each step /3 we add a vector in the new orthogonal direction ep of 
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length yjq\p\ — <l\p\-\- We define a random measure G on the Hilbert space H by 

G(h a ) = v a for a e N r . (30) 

The measure G is called the Ruelle probability cascades (RPC) associated to the parameters © 
and (flOl) . From the definition (|29|) . it is clear that the scalar product h a ■ hp between any two points 
in the support of G depends only on the number 

ctAjS := \p(a)r\p(P)\ (31) 

of common vertices in the paths from the root to the leaves a,j8 G W. With this notation, (|29l ) 
implies that h a -hp = q aA p . Now, if we take three leaves a, /3 , y E W then their paths satisfy 

/3 Ay> min(a A/3,aAy), 

since the vertices shared by the path p(a) with both paths p(/3) and p(y) will also be shared by 
p(fi) and p[y) and, therefore, 

hp-h Y >min(h a -hp,h a -h r ), (32) 

so the support of G is ultrametric in H by construction. In the work of Ruelle, 11361 , it was stated 
as an almost evident fact that the Gibbs measure in the Derrida GREM looks like the measure (1301 ) 
in the infinite- volume limit, but a detailed proof of this was given later by Bovier and Kurkova in 
O. Because of the connection to the SK model mentioned above, the Ruelle probability cascades 
are precisely the measures that were expected to describe the Gibbs measures in the SK model 
in the sense that, asymptotically, the overlap array (TT6l) can be approximated in distribution by an 
overlap array generated by some RPC. The points (h a ) a<E fqr are the pure states and the tree srf can 
be viewed as a branching tree that indexes the clusters around all the pure states. One can show that 
the distribution (fl9l ) of the overlap of two replicas sampled from the Ruelle probability cascades is 
equal to the distribution function in ©, which agrees with the Parisi interpretation of the functional 
order parameter £. 

Such an explicit description of the expected asymptotic Gibbs measures was a very big step, 
because one could now study their properties using the entire arsenal of the theory of Poisson pro- 
cesses ( |fT9l ). Some important properties of the Ruelle probability cascades were already described 
in the original paper of Ruelle 11361 . while other important properties, which express certain invari- 
ance features of these measures, were discovered later by Bolthausen and Sznitman in [5 ]. We will 
mention this again below when we talk about the unified stability property in the SK model. In 
the next few section we will explain that all the predictions of the physicists about the structure 
of the Gibbs measure in the SK and mixed p-spin models are, essentially, correct. In general, they 
hold under a small perturbation of the Hamiltonian, which does not affect the free energy in the 
infinite- volume limit, but for a class of the so-called generic mixed p-spin models they hold pre- 
cisely, without any perturbation. First, we will explain the connection between the Gibbs measure 
and the Parisi formula for the free energy. 

2 Free energy and Gibbs measure 

The Aizenman-Sims-Starr scheme. Before we describe rigorous results about the structure of 
the Gibbs measure, let us explain how this structure implies the Parisi formula for the free energy 
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(fl4l) . For simplicity of notation, we will focus on the Sherrington-Kirkpatrick model (OQ) instead 
of the general mixed p-spin model ©. We begin with the so-called Aizenman-Sims-Starr cavity 
computation, which was introduced in 0. Let us recall the definition of the partition function Zpj 
in © and for j > let us denote 

Aj = ElogZ/+i - ElogZ/, (33) 

with the convention that Zq = 1. Then we can rewrite the free energy as follows, 

1 1 N ~ l 

F N =-mogZ N =-^Aj. (34) 

Clearly, this representation implies that if the sequence converges then its limit is also the limit 
of the free energy F^. Unfortunately, it is usually difficult to prove that the limit of A^ exists (we 
will mention one such result at the end, when we talk about generic mixed p-spin models) and, 
therefore, this representation is used only to obtain a lower bound on the free energy, 

liminfF/v > liminfA^. (35) 

iV— >°° N^l-oo 

Let us compare the partition functions Zyy and Z N+ \ and see what they have in common and what 
makes them different. If we denote p = (a, e) G Ln+\ for eljv and £ G { — 1, + 1} then we can 
write 

H N+l {p)=H' N (o) + ez N (o), (36) 

where 

H' N (°) = ~^=j I StjWj (3V) 

and 

1 N 

ZN ^ = /WTT E iSi(N+l) +g(N+l)i)Gi- (38) 
V A' + 1 (=i 

One the other hand, the part (1371) of the Hamiltonian Hm+\ (p) is, in some sense, also a part of the 
Hamiltonian H^{o) since, in distribution, the Gaussian process H^(o) can be decomposed into a 
sum of two independent Gaussian processes 

H N (o)=H' N (o)+y N (o), (39) 

where 

for some independent array (g'^) of standard Gaussian random variables. Using the above decom- 
positions (1361) and (1391 , we can write 

ElogZ,v +1 =Elog £ 2ch(-/3 ZA ,(c7))exp(-pX(c7)) (41) 
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and 

ElogZ w = Elog £ exp(-/3)w(c7))exp(-/3/4(c7)). (42) 
Finally, if we consider the Gibbs measure on corresponding to the Hamiltonian H' N {o) in (1371) , 
G' N {a) = £Xp( "^ (g)) where Z' N = £ exp(-/3^(a)), (43) 

then (I4TT) . (1421) can be combined to give the Aizenman-Sims-Starr representation, 

A* = Elog £ 2ch(-/3z^(a))G^(a) 

-Elog £ exp(-/3vyv(c7))G^(cT). (44) 

Notice that the Gaussian processes (zn(g)) and (yiv(c)) are independent of the randomness of the 
measure G^, and have the co variance 

Ez N (a l )z N (o 2 ) = 2R h2 + 0(N- l ) 1 Ey N (a l )y N (a 2 ) = Rl 2 + 0(N- 1 ). (45) 

Suppose that we replace the Gibbs measure G' N in (l44l) by the Ruelle probability cascades G in 
(1301) and replace the Gaussian processes (zn(<j)) and (y^(o)) by Gaussian processes (z(h a )) and 
(y(h a )) indexed by the points (/i«)aeN r m the support of G with the same covariance structure as 

da), 

Ez(h a )z(h p ) = 2h a -h p , Ey(h a )y(h l5 ) = (h a -h l5 ) 2 . (46) 

Such processes are very easy to construct explicitly, if we recall the definition of the points h a in 
(|29l ). Namely, let (f]a)ae^\N ^ e a sequence of i.i.d. standard Gaussian random variables and, for 
each p > 1, let us define a family of Gaussian random variables indexed by (h a )aeW> 

8p(ha)= £ IpiqL-qL^ 2 - (47) 

Recalling the notation (1311) . it is obvious that the covariance of this process is 

Eg p (h a )g p (h p ) = g P aAp = (h a -h p )P, (48) 

so we can take z(h a ) = y/2g\(h a ) and y(h a ) = g2{h a )- Then the functional (|44"1) will be replaced 

by 

^(C)=Elog £ 2ch(-/3z(/*«))v a -Elog £ exp(-j3y(/i a ))v a . (49) 

aeW aeW 

Writing £P(Q here is not an abuse of notation, since it turns out that the right hand side coincides 
with the Parisi functional in (fT3l when ^ (x) = x 2 and 0(x) = x 2 , which is precisely the case of the 
Sherrington-Kirkpatrick model. The equality of these two different representations can be proved 
using the properties of the Poisson processes with the mean measures (|25l) that appear in the defi- 
nition of the Ruelle probability cascades, and (|49l gives a very natural interpretation of the Parisi 
functional £?(Q in (fT3l) . It remains to explain that, if we assume the Parisi ansatz for the Gibbs 
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measure, then the connection between (l44l) and (|49l) is more than just a formal resemblance and 
that together with (1351) it implies that 

liminf/fy>inf (50) 

This is again a consequence of the fundamental fact that we mentioned above, namely, that all the 
relevant information about the Gibbs measure in the SK model is contained in the overlap matrix. 
In the present context, it is not difficult to show that, due to the covariance structure of the Gaussian 
processes (|45T) and (|46|) . the quantities in (|44|) and ^(Q in (l46l) are, in fact, given by the same 
continuous functional of the distribution of the overlap arrays R N and R generated by i.i.d. samples 
of replicas from G' N and G correspondingly. As a result, if we consider a subsequence along which 
the liminf^^ooA^/ is achieved and, at the same time, the array R N converges in distribution to some 
array R°°, then the lower limit of can be written as the same functional of the distribution of 
Finally, if we believe that the predictions of the physicists are correct, we can approximate R°° 
in distribution by the overlap arrays R generated by the Ruelle probability cascades and, therefore, 
the lower limit is bounded from below by inf^ which proves (l50l) . The main difficulty in this 

approach is to show that the Parisi ansatz for the Gibbs measure and the overlap array is, indeed, 
correct in the infinite-volume limit, which will be discussed below. 

Guerra's replica symmetry breaking bound. The fact that the Parisi formula also gives an 
upper bound on the free energy, 

limsupF^v <mf&(Q, (51) 

was proved in a breakthrough work of Guerra, [18J. The original argument in lfT8l was given in 
the language of the recursive formula (PT2l) . but, as was observed in [0, it can also be written in 
the language of the Ruelle probability cascades. The essence of Guerra's result is the following 
interpolation between the SK model and the Ruelle probability cascades. Let (zi(h a )) and (yi(h a )) 
for i > 1 be independent copies of the processes (z(h a )) and (y(h a )) in (l46l) and, for < t < 1, let 
us consider the Hamiltonian 

N N 

H N , t {G,h a ) = VtH N (o) + VT^tY,z l (h a )o i + Vi 1 Eyi( h a) (52) 

z=l i=l 

indexed by vectors (a,/i a ) such that o belongs to the support of the Gibbs measure Gm and 
h a belongs to the support of the measure G in (|3Q|) . To this Hamiltonian one can associate the free 
energy 

(p(0 = ^Elog£v a exp(-/3//^(a,/z a )) (53) 

and, by a straightforward computation using the Gaussian integration by parts, one can check that 
cp'(t) < and, therefore, <p(l) < <p(0). It is easy to see that 

9>(0) = -j-Elog £ v a n 2ch (-MM) 
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and 

9(l)=Fjv + iElog £ v o nexp(-J3y/W)- 

yv aeN r i<w 

It is, again, a consequence of the properties of the Poisson processes involved in the construction 
of the Ruelle probability cascades that, in fact, the independent copies for i < N can be decoupled 
here and 

<p(0)=Elog £ 2ch(-/3z(/i„))v a 

aeW 

and 

(p(\)=F N + E\og £ exp(-Py(h a ))v a . 

aeW 

Recalling the representation (|49| ), the inequality <p(l) < <p(0) can be written as Fm < ^(Q, which 
yields the upper bound (ISP) . After Guerra's discovery of the above interpolation argument, Ta- 
lagrand proved in his famous tour-de-force paper [|42l that the Parisi formula, indeed, gives the 
free energy in the SK model in the infinite- volume limit. Talagrand's ingenious proof finds a way 
around the Parisi ansatz for the Gibbs measure, but it is rather involved. The Aizenman-Sims-Starr 
scheme above gives a more natural approach if we are able to confirm the Parisi ansatz for the 
asymptotic Gibbs measures. Moreover, the argument in H21 works only for mixed p-spin models 
for even p > 2, while the above approach can be modified to yield the Parisi formula in the case 
when odd p-spin interactions are present as well (see [30J). Nevertheless, to understand the impact 
of the results of Guerra [TT8l and Talagrand [|42| (proved in 2003), one only needs to remember that 
a proof of the existence of the limit of the free energy by Guerra and Toninelli in [17] was quite an 
impressive result only a year earlier. 

3 Stability of the Gibbs measure 

The Ghirlanda-Guerra identities. Below we will explain an approach to proving the predic- 
tions of the physicists for the Gibbs measure based on the so called Ghirlanda-Guerra identities. 
These identities were first discovered by Ghirlanda and Guerra in lfT6ll in the setting of the mixed 
p-spin models, where they were proved on average over the parameters (f5 p ) in ©. However, the 
general idea can be used in many other models if we utilize the mixed p-spin Hamiltonian in the 
role of a perturbation. We will try to emphasize wide applicability of this idea by giving some mild 
sufficient conditions that ensure the validity of these identities. For all p > 1, let us consider 

1 N 

(i,...,ip=l 

where the random variables (g' t ■ ) are i.i.d. standard Gaussian and independent of everything 
else, and define 

g(o) = £ 2-P Xp8p (a) (55) 
p>i 

for some parameters (x p ) p >i that belong to the interval x p E [0, 3] for all p > 1 . This Gaussian pro- 
cess is of the same nature as the mixed p-spin Hamiltonian © except for a different normalization 
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in (|54l) . which implies that the covariance 

Eg(a l )g(a 2 )=^4-Px 2 p R{ 2 . (56) 
P >i 

In other words, g(o) is of a smaller order than H^(o) because of the additional factor TV -1 / 2 . Let 
us now consider a model with an arbitrary Hamiltonian H(o) on L^, either random or non-random, 
and consider the perturbed Hamiltonian 

HP en (o)=H(o)+sg(o), (57) 

for some parameter s > 0. What is the advantage of adding the perturbation term (|55l) to the original 
Hamiltonian of the model? The answer to this question lies in the fact that, under certain conditions, 
this perturbation term, in some sense, regularizes the Gibbs measure and forces it to satisfy useful 
properties without affecting our mail goal — the computation of the free energy. Using (|56l) and the 
independence of g(a) and H(o), it is easy to see that 

i-Elog £ expH(a) < 

< 

Both inequalities follow from Jensen's inequality applied either to the sum or the expectation with 
respect to g(a) conditionally on H(o). This implies that if we let s = in (1571) depend on N in 
such a way that 

lim AT ^ = 0, (59) 

then the limit of the free energy is unchanged by the perturbation term sg(o). On the other hand, 
if s = Sjy is not too small then it turns out that the perturbation term has a non-trivial influence on 
the Gibbs measure of the model. Consider a function 

<p = log £ exp(H{a)+sg(a)) (60) 

that will be viewed as a random function <p = (p((x p )) of the parameters (x p ), and suppose that 

sup|E|<p-E<p| :0<x p <3,p> lj < v N (s) (61) 

for some function vn(s) that describes how well <p((jc p )) is concentrated around its expected value 
uniformly over all possible choices of the parameters (x p ) from the interval [0,3]. Main condition 
about the model will be expressed in terms of this concentration function, namely, that there exists 
a sequence s = sn such that 

lim sv = °° and lim Sk 2 vn(sm) = 0. (62) 

Of course, this condition will be useful only if the sequence sn also satisfies (l59l) and the pertur- 
bation term does not affect the limit of the free energy. In the case of the mixed p-spin model, 



-Elog £ exp{H(o)+sg(o)) (58) 
iV <reZ N 

i-Elog £ exp//(a) + ^-£4-^. 
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H(o) = —f5H^(o) for Hpj(o) defined in ©, one can easily check using some standard Gaussian 
concentration inequalities that ( |59| ) and (l62l) hold with the choice of = for any l/4<y< 1/2. 
One can also check that this condition holds in other models, for example, random p-spin and K-sat 
models, or for any non-random Hamiltonian H(o). Now, let 

G N (a) = £Xp/ l Pert(g) , where Z N = £ exp//P ert (a), (63) 

Z N oe£ N 

be the Gibbs measure corresponding to the perturbed Hamiltonian (1571) and let (■) denote the av- 
erage with respect to G^°°, or all replicas. For any n > 2,p > 1 and any function / of the overlaps 
(R[/)i/<n of n replicas, let us define 

1 . . . 1 JL , 

(64) 



A(f,n,p) = E(fRP n+1 ) - l -E(f)E(RQ - ^£e(/^) 



If we now think of the parameters (x p ) p >i in (1551) as a sequence of i.i.d. random variables with the 
uniform distribution on [1,2] and denote by E x the expectation with respect to such sequence then 
(|62l) is a sufficient condition to guarantee that 

KmE x A(f,n,p) = 0. (65) 

Once we have this statement on average, we can, of course, make a specific non-random choice of 
parameters (x p J ) p >i, which may vary with Af, such that 

lim A(f,n,p) = 0. (66) 

N— 

In the thermodynamic limit, this can be expressed as a property of the asymptotic Gibbs measures. 
Let us consider any subsequential limit of the distribution of the overlap matrix R N generated by 
the perturbed Gibbs measure Gm in (1631) and let G be the corresponding asymptotic Gibbs measure 
on a Hilbert space defined via the Dovbysh-Sudakov representation. If we still denote by (•) the 
average with respect to G®°° then (l66l) implies the Ghirlanda-Guerra identities, 



E </<» + i> = ^</>E« 2 > + 1 f (67) 

for any n>2,p> 1 and any function / of the overlaps (Rii>)ii>< n = (o l ■ <J l ')iji< n of n replicas 
sampled from G. Of course, since any bounded measurable function i/a can be approximated by 
polynomials (in the L 1 sense), 

^fY(R^,n+l)) = -E(f)^^j/(R l2 )) + -f J E(f ¥ (Ru)). (68) 

n n [=2 

If £ is the distribution of one overlap, as in (fT9l) , then (1681) can be expressed by saying that, condi- 
tionally on (Rj i')u'< n , the distribution of R\ : n+i is given by the mixture 



1=2 
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These identities already appear in the Parisi replica method where they arise as a consequence of 
"replica equivalence", but Ghirlanda and Guerra gave the first mathematical proof using the self- 
averaging of the free energy, which is what the condition (|62|) basically means. The self-averaging 
of the free energy in the Sherrington-Kirkpatrick model was first proved by Pastur and Shcherbina 
in 11351 . The identities ([67]) might look mysterious, but, in fact, they are just a manifestation of the 
general principle of the concentration of a Hamiltonian, in this case 

Mm E x E(\g p (o) - E(g p (o) > | > = 0, (69) 

which can be proved using the self-averaging of the free energy condition (|62|) . The way (l69l 
implies the Ghirlanda-Guerra identities is very simple, essentially, by testing this concentration on 
a test function. If we fix n > 2 and consider a bounded function / = f{(Ri i')i i'< n ) of the overlaps 
of n replicas then 

|E</^(a 1 ))-E</)E<^(cj)>| < \\f\\^E(\g p (a)-E(g p (a))\) (70) 
and (l69l implies that 

\imE x \E(f gp (a l ))-E(f)E(g p (a))\=0. (71) 
This is precisely the equation (|65l) after we use the Gaussian integration by parts. 



The Aizenman-Contucci stochastic stability Another famous property of the Gibbs measure, 
the so-called stochastic stability discovered by Aizenman and Contucci in [lj, is also a conse- 
quence of (l69l) . A proof can be found in [43] (see also flU) and a rigorous justification of how to 
extend the stochastic stability to the setting of the asymptotic Gibbs measures can be found in [4J. 
To state this property, let us assume, for simplicity, that the asymptotic Gibbs measure G is atomic, 
G(h[) = v/, with the weights arranged in non-increasing order, vi > v% > . . . . Given integer p > 1, 
let {gp{hi))i>\ be a Gaussian sequence conditionally on G indexed by the points (/i/)/>i with the 
covariance 

Eg p (hi)g p (h l ,) = (h l -h l ,)P, (72) 

which is reminiscent of (1481) , only now we do not know a priori that the support {h[ : / > 1} is 
ultrametric in H. Given t e R, consider a new measure 



G t [hi) = v l = 1 (73) 

T,j>iVjexptgp(hj) 

defined by the random change of density proportional to exptg p (hi). Then the Aizenman-Contucci 
stochastic stability, basically, states that this new measure generates the same overlap array in 
distribution as the original measure G. One way to express this property is as follows. Let n : N — > N 
be a permutation such that the weights (v^/») are also arranged in non-increasing order. Then, 

(( v Ui))i>v( h m- h m)i,i>>i) = (Mz>i>(^-Mz/>i) ( 74 ) 

for any p > 1 and t G R, which, clearly, implies that the overlap arrays generated by these mea- 
sures will have the same distribution. It is not difficult to see that, in fact, the two statements are 
equivalent. 
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Unified stability property. Even though the proof of the Parisi ansatz that will be described 
in the next section is based only on the Ghirlanda-Guerra identities, the reason we mention the 
Aizenman-Contucci stochastic stability d74l) is because, in a number of ways, it played a very 
important role in the development of the area. In particular, one of the main ideas behind the 
proof of the Parisi ansatz was first discovered using a unified stability property, proved in [|28l . 
that combines the Ghirlanda-Guerra identities (l67l) and the Aizenman-Contucci stochastic stability 
(|74|) . It can be stated with the notation in (l74l) as follows. It is well known (we will discuss this 
again below) that if the measure G satisfies the Ghirlanda-Guerra identities and if c 2 is the largest 
point of the support of the distribution of the overlap R\ 2 under EG® 2 then, with probability one, 
G is concentrated on the sphere of radius c. Let 

b p = (c 2 y-E(R p 12 ). (75) 

Then, a random measure G satisfies the Ghirlanda-Guerra identities (l67l) and the Aizenman-Contucci 
stochastic stability (|74|) if and only if it is concentrated on the sphere of constant radius, say c, and 
for any p > 1 and t e M, 

(K(/))/>i' M^m) - b P t )i>v i h *{i) ■ h m)i,i'>i) 

= (( v i)i>v(s P ( h i)),>v( h r h i')i,i>>i)- (76) 

Comparing with (1741) . we see that the Ghirlanda-Guerra identities are now replaced by the state- 
ment that, after the permutation % which rearranges the weights in (1731) in the decreasing order, 
the distribution of the Gaussian process (g p (hi)) will only be affected by a constant shift b p t. 
Interestingly, the unified stability property (|76b was known for some time in the setting of the 
Ruelle probability cascades, where it was proved by Bolthausen and Sznitman in [5] using proper- 
ties of the Poisson processes in the construction of the Ruelle probability cascades. However, the 
Ghirlanda-Guerra identities for the RPC were originally proved by Talagrand ll3~9ll and Bovier and 
Kurkova [6 J by analyzing the Gibbs measure in the Derrida REM and GREM, and it was only later 
noticed by Talagrand that they follow much more easily from the Bolthausen-Sznitman invariance. 
The main result of Il28l stated in (1761) . basically, reverses Talagrand's observation. 

4 Structure of the Gibbs measure 

It was clear since they were discovered that the stability properties impose strong constraints on 
the structure of the Gibbs measure, but the question was whether they lead all the way to the Ruelle 
probability cascades? The first partial answer to this question was given in an influential work of 
Arguin and Aizenman, Q, who proved that, under a technical assumption that the overlap takes 
only finitely many values in the thermodynamic limit, R\ 2 G {qo, . . . , q r }, the Aizenman-Contucci 
stochastic stability (|74|) implies the ultrametricity predicted by the Parisi ansatz. Soon after, it 
was shown in ll25l under the same technical assumption that the Ghirlanda-Guerra identities also 
imply ultrametricity (an elementary proof can be found in 112710 . Another approach was given by 
Talagrand in f43). However, since at low temperature the overlap does not necessarily take finitely 
many values in the thermodynamic limit, all these result were not directly applicable to the SK 
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and mixed p-spin models. Nevertheless, they strongly suggested that the stability properties can 
explain the Parisi ansatz and, indeed, it was recently proved in [|29l that the Ghirlanda-Guerra 
identities imply ultrametricity (|2T|) without any technical assumptions. Before we explain some 
of the main ideas in the proof, let us first describe several preliminary facts that follow from the 
Ghirlanda-Guerra identities. 

Pure states. First of all, one can show very easily (see [25 J) that the Ghirlanda-Guerra identities 
(1671) yield the Parisi pure states picture described above. Namely, if c 2 is the largest point in the 
support of the distribution £ of the overlap R\ 2 under EG® 2 defined in (fT9l ) then, with probability 
one, G(||/z|| = c) = 1. Moreover, the measure G is purely atomic if £({c 2 }) > 0, otherwise, it has 
no atoms. Of course, it is clear that Parisi's pure states picture in ||34l was meant to be understood 
in approximate sense and, when £({c 2 }) = and G has not atoms, we can create pure states using 
ultrametricity by considering equivalence clusters in (|22|) for small positive diameter d > 0. In the 
case when £({c 2 }) > 0, the pure states picture holds not only for the asymptotic Gibbs measures 
in the infinite-volume limit, but also for the original Gibbs measures G^ for finite size systems in 
some approximates sense, as was shown by Talagrand in [|43l . 

Talagrand's positivity principle. Another important consequence of the Ghirlanda-Guerra iden- 
tities is the so-called Talagrand positivity principle, proved in ll39ll . which states that the overlaps 
can take only non-negative values in the thermodynamic limit, so a 1 ■ a 2 > for any two points in 
the support of G. In the Parisi replica method, the overlap was always assumed to be non-negative 
due to the symmetry breaking, and we see that this, indeed, can be obtained using a small pertur- 
bation of the Hamiltonian which ensures the validity of the Ghirlanda-Guerra identities. One key 
application of the positivity principle is to show that Guerra's interpolation argument leading to the 
upper bound (ISTT) also works for mixed p-spin models that include the pure p-spin Hamiltonians 
for odd p>3, which was observed by Talagrand in HOl . 

Characterizing asymptotic Gibbs' measures. Finally, let us mention a fact that has been well 
known since the discovery of the Ghirlanda-Guerra identities, namely, that together with ultra- 
metricity these identities determine the distribution of the entire overlap array uniquely in terms 
of the functional order parameter £ and, moreover, one can approximate the overlap array in dis- 
tribution by an overlap array generated by some Ruelle probability cascades. In other words, as 
soon as we have ultrametricity, all the predictions of the physicists are confirmed. The idea here 
is very straightforward and we will only illustrate it in the simplest case when the overlaps take 
finitely many values, R12 £ {qo, • • • ilr}- The general case easily follows by approximation. In the 
discrete case, we only need to demonstrate that, using ultrametricity and the Ghirlanda-Guerra 
identities, we can compute in terms of £ the probability of any particular configuration of finitely 
many overlaps, 

E(l(R hV = q ir .l^l'<n + \)), (77) 

for any n > 1 and any q\ y 6 {go, • ■ • , q r }- Let us find the largest elements among qi y for l^V and, 
without loss of generality, suppose that qi t n+i is one of them. We only have to consider (qu>) that 
are ultrametric, since, otherwise, (1771) is equal to zero. In particular, since q\ n+ \ is the largest, for 
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2 < I < n, 

qi,l > min(<5ri i „ +1 ,^r/ i „ +1 ) = q Un+x 

and 

qi, n +i > ™in(qi, n +i,qi,i) =q\,u 

which implies that qu = qi, n +\. Hence, if the overlap R\ ; n+l = qi,n+l then, for all 2 < / < n, 
Rll — q\,l automatically implies that Ri, n +i = q\i and (fTTT) equals 

E(l(Ru> = qi,l> :l,l'< n)l(R Un+1 =q hn+1 )). (78) 

In other words, if we know that the replicas a 1 and c' !+1 are the closest then, due to ultrametricity, 
all the conditions Ri, n +i = qi, n +\ = q\l become redundant and we can omit them. The quantity 
(1781 is now of the same type as the left hand side of (1681) and, therefore, the Ghirlanda-Guerra 
identities imply that it is equal to 

U{{qi,n+i})^(l(Ri,v = qi,v :l,l'<n)) (79) 
+- L%M = qi,n+i)^(l(Ri,i> = qi,V ■■i,i'< «)>• 

H 1=2 

We can continue this computation recursively over n and, in the end, (1771) will be expressed com- 
pletely in terms of the distribution of one overlap, £. To conclude that the overlap array can actually 
be generated by the Ruelle probability cascades corresponding to the functional order parameter 
£, we only need to recall that both properties, the Ghirlanda-Guerra identities and ultrametricity, 
are satisfied by the RPC, so all the probabilities (1771) will be given by the same computation. 



Ultrametricity. It remains to explain the main result in [1291 which shows that ultrametricity is 
also a consequence of the Ghirlanda-Guerra identities. The main idea of the proof is the following 
invariance property. Given n > 1, we consider n bounded measurable functions fi, ...,/„: M.— >■ M 
and let 

F(o 1 o\...,o n )=Mo-o l ) + ...+f n (a-a n ). (80) 

For 1 < / < n, we define 

F,(o,o\...,o n )=F(o,o\...,o n )-f l (o-o l )+E(f l (R L2 )), (81) 

where, as before, (•) denotes the average with respect to G®°°. Then, for any bounded measurable 
function <J> of the overlaps (Ri/)i/< n of n replicas, 

E(^) = E(^ eX / PE ^ (g ; gl --f ) \, (82) 
x 1 \ (expF(c7,a 1 ,...,a"))^ /' v 

where (•)_ in the denominator is the average in o for fixed a 1 , . . . , a" with respect to the measure 
G. One can think of the ratio on the right hand side as a change of density that does not affect the 
distribution of the overlaps of n replicas. Originally, this invariance property was discovered using 
the unified stability property (1761) . so the Aizenman-Contucci stochastic stability played an equally 
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important role. However, the proof presented in 11291 is much more simple and straightforward, 
and is based only on the the Ghirlanda-Guerra identities. In some sense, this is good news because 
the Aizenman-Contucci stability is a more subtle property to work with than the Ghirlanda-Guerra 
identities, especially in the thermodynamic limit. To prove (f8~2l) . one can consider an interpolating 
function 

/ expT,?-,tF l (o l ,o 1 ,...,o n )\ 
<p(t)=E(4> , - r — - — kr^) (83) 

rw \ (exp/' J F(c7,a 1 ,...,a"))^ / 

and, using an elementary calculation, check that the Ghirlanda-Guerra identities imply that all 
the derivatives vanish at zero, (p( k \0) = 0. Taylor's expansion and some basic estimates of the 
derivatives yield that this function is constant for all t > 0, proving that <p(0) = <p(l), which is 
precisely (1821) . A special feature of the invariance property (1821) is that it contains some very useful 
information not only about the overlaps but also about the Gibbs weights of the neighborhoods of 
the replicas a 1 , . . . , o 11 . Let us give one simple example. Recall that the measure G is concentrated 
on the sphere \\h\\ = c and, for q = c 2 — £, let f\ (x) = tl(x > q) and fi = ... = /„ = 0. Then 

F(a,a 1 ,...,a n )=tl(a-a 1 >q) 

is a scaled indicator of a small neighborhood of a 1 on the sphere = c. If we denote by W\ = 
G(o : o ■ a 1 > q) the Gibbs weight of this neighborhood then the average in the denominator in 
(f8~2l) is equal to 

(exp J F(cr, a 1 , . . . , o n ))_ = Wie* + 1 - W\. 
Suppose now that the function <3? = Ia is an indicator of the event 

A = { (a 1 , . . . , o n ) : cr 1 • a 1 < q for 2 < I < n) 

that the replicas a 2 , . . . , o n are outside of this neighborhood of a 1 . Then, it is easy to see that 



£ Fi(o l , o\...,a n )= tE(I(R h2 > q)) =: ty 



and (|82l) becomes 



fy 

E(I a )=e(i a - — -), (84) 

which may be viewed as a condition on the weight W\ and the event A. This is just one artifi- 
cial example, but the idea can be pushed much further and with some work one can obtain some 
very useful consequences about the structure of the measure G. One of these consequences is the 
following "duplication property". 

Suppose that with positive probability over the choice of the measure G we can sample n 
replicas cr 1 , . . . , o n from G that are approximately at certain fixed distances from each other. Of 
course, since all replicas live on the same sphere, this can be expressed in terms of the overlaps, 
Rl l> ~ ai y, for some n x n matrix of constraints A = (a/./')- Let 

a* =max(ai )n ,...,a„_i )M ) 

be the constraint corresponding to the closest point among a 1 , . . . , a 11 to the last replica o n and 
suppose that the distance between them is strictly positive, a* < c 2 . Then, using the invariance 
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Figure 2: Duplication property. The grey area corresponds to all the points on the sphere \\h\\ = c 
which are approximately at the same distance from the first n— 1 replicas G , ...,G n ~ as the 
last replica o n . Then the white point is a duplicate o n+l of a". It is in the grey area, so it is 
approximately at the same distances from the replicas a 1 , . . . , a" -1 as o 11 , and it is at least as far 
from o n as the closest of the first n — 1 replicas, in this case o"~ [ . 

property (|82|) . one can show that with positive probability over the choice of G one can sample 
n+ 1 replicas G G n+1 from G such that the distances between the first n replicas are as above, 
Rui a/ ii, and the new replica o n+l duplicates o n in the following sense (see Fig.©. First of all, 
it is approximately at the same distances from the replicas a 1 , . . . , a" -1 as G n , 

and, moreover, it is at least as far from o n as the other closest replica, R n , n+ i < a*. The motivation 
for this property becomes clear if we recall how the support of the Ruelle probability cascades 
was constructed in (|29| ). In that case it is obvious that one can always duplicate any replica with 
probability one, and not just in a weak sense described here. On the other hand, even this weak 
duplication property implies that the support of the measure G is ultrametric with probability one. 

To see this, suppose that ultrametricity is violated and with positive probability three replicas 
can take values 

Ri t2 ~*,#i,3 and R 2 ,3 ~z (85) 
for some constraints x < y < z < c 2 . Let us duplicate each replica in the above sense m — 1 times, 
so that at the end we will have n = 3m replicas. Suppose that 

{l,...,n} = / 1 U/ 2 U/ 3 

is the partition such that j G Ij for j < 3, = m and each Ij \ {j} is precisely the index set of 
duplicates of G J . Then, it should be almost obvious that 
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(a) /? z/ <zforalH^r<n, 



(b) Rijt « x if / 6 /i, /' G h, Ri,i> &yifleh,l' eh and R LU ps z if / g /j, /' G Z?. 

Property (a) holds, because a new replica never get too close to the old replicas and the overlap 
will never exceed z, and property (b) holds, because, every time we duplicate a point, the distances 
to all other points will be the same, so the overlaps between point in different groups h,l2,h will 
always be the same as the original constraints in (|85l) . This means that with positive probability we 
can find replicas a , . . . , o n on the sphere of radius c such that their overlaps satisfy properties (a) 
and (b). Let <7 J be the barycenter of the set {o l : / G //}. The condition (a) implies that 

11-7112 1 r» /n2 1 v ri . mc 2 + m(m — \)z 
\\<J J \\ = — * >2 °" "I — o E ^//'^ 1 — — > 

11 11 m 2 ^ 11 11 m 2 ^ M ~ m 2 ' 

and the condition (b) implies that a 1 • a 2 ~ x, a 1 • a 3 ~ y and a 2 • <7 3 ~ z. Hence, 
||a 2 -g 3 || 2 = ||g 2 || 2 + ||g3||2_ 2 ^2^3< 2(c 2 -z) 

and < /3 — a ~ a 1 • a 3 — a 1 • <7 2 < KmT x l 2 . We arrive at contradiction by letting m — > 00. The 
conclusion is that if the measure G satisfies the Ghirlanda-Guerra identities then its support must 
be ultrametric with probability one. Therefore, as was explained above, under a small perturbation 
of the Hamiltonian that yields the Ghirlanda-Guerra identities, all possible limits of the Gibbs 
measure can be identified with the Ruelle probability cascades. Moreover, we will see below that 
for "generic" mixed p-spin models no perturbation is necessary and the limit of the Gibbs measure 
is unique. 

5 Consequences of the Parisi formula 

Once we know the Parisi formula for the free energy in the mixed p-spin models, some results can 
be extended and strengthened. 

Universality in the disorder. First of all, one can prove the universality in the disorder and show 
that the Parisi formula (fl4l) still holds if the Gaussian random variables (gjj) in the Hamiltonian 
(OQ) are replaced by i.i.d. random variables (xy) from any other distribution, as long as 

Ex n =0,Exfi = 1 and E\x n \ 3 < °°. (86) 

This was proved by Carmona and Hu in [7], who generalized an earlier result of Talagrand ||38l in 
the case of the Bernoulli disorder. The proof is based on the following interpolation between the 
two Hamiltonians for < t < 1 , 

1 N 

H N j(o) = —= £ (V~txij + VT^tgij) OiOj, (87) 
VAT u=1 

and the estimates of the derivative of the free energy along this interpolation using some approxi- 
mate integration by parts formulas. The same result was also proved in for the p-spin model. 



21 



Generic mixed p-spin models. In another direction, we can say more about the thermodynamic 
limit in the case of the so-called generic mixed p-spin models whose Hamiltonian © contains 
sufficiently many pure p-spin terms ©, so that the following condition is satisfied: 

(G) linear span of constants and power functions x p corresponding to fi p ^ is dense in 

(c[-i,i],||-||-). 

The reason for this is that each pure p-spin term contains some information about the pth moment 
of the overlap and the condition (G) allows us to confirm the Parisi ansatz for the Gibbs measure 
in the thermodynamic limit without the help of the perturbation term in (1551) , so the result becomes 
more pure, in some sense. Moreover, in this case we can show that the asymptotic Gibbs measure 
is unique. This can be seen in two steps. Let us denote by & = ^((/3 p )) the infimum on the 
right-hand side of (fl4l) and let be the set of all limits over subsequences of the distribution of 
the overlap R\ 2 of two spin configurations sampled from the Gibbs measure Gm corresponding to 
the Hamiltonian (HI). It was proved by Talagrand in iflTTl that, for each p > 1, the Parisi formula is 
differentiable with respect to /3 p and 

§£ = /!,(! -/«(,)) (88) 

for all £ G If P P 7^ 0, then this implies that all the limits £ G j$ have the same pth moment 
and the condition (G) then implies that M = {£0} for some unique distribution £0 on [—1, 1]. As a 
second step, one can prove the convergence of the entire overlap array (/?//) //>i in distribution as 
follows. As a consequence of the differentiability of the Parisi formula, it was proved in [26] that, 
whenever j8 p 7^ 0, the Ghirlanda-Guerra identities 

E(/< n+ i> = V/>E(i?f )2 > + i (89) 

for the pth moment of the overlap hold in the thermodynamic limit in a strong sense, for the 
Gibbs measure Gn corresponding to the original Hamiltonian © without the perturbation term 
(1551) . The condition (G) then, obviously, implies that the general Ghirlanda-Guerra identities (1681) 
also hold. As we explained above, the Ghirlanda-Guerra identities imply ultrametricity and, as a 
result, the distribution of the entire overlap array can be uniquely determined by the distribution of 
one overlap. Since this distribution £0 is unique, the distribution of the entire overlap array under 
EG^°° also has a unique limit, so the asymptotic Gibbs measure is unique. Notice also that, by 
Talagrand's positivity principle, the distribution £0 is, actually, supported on [0, 1]. Finally, in this 
case one can show using the Aizenman-Sims-Starr scheme (|44l) that the limit of the free energy is 
equal to ^(£0), where we understand that the definition of the Parisi functional is extended 

to all distributions on [0, 1] by continuity. Thus, the infimum inf^ in the Parisi formula (fl4)) 

is achieved on the asymptotic distribution of the overlap, £q. The functional ^(Q is conjectured 
to be convex in £ (see ll23l for a partial result) and, if true, this would imply that £0 is the unique 
minimizer of ^(C)- Convexity of would also give a more direct approach to describing the 

high temperature region, which was done by Talagrand in [|39l (see also [44]). 
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